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(54) Document processing apparatus and method 



(57) A document processing apparatus (1) has a 
language tag setting section (15) for setting a language 
tag designating a kind of language at each of constituent 
unit positions in an original and those in a translated ver- 
sion corresponding to the original, and a document 



forming section (15) for forming a tagged document in- 
cluding the original and the translated version each hav- 
ing the language tag set therein. This apparatus enables 
a user to easily form and maintain a home page (Web 
page data) expressed In a plurality of languages. 
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Description 

BACKGROUND OF THE INVENTION 

[0001] The present invention relates to a document 
processing apparatus and method which analyzes a 
tagged document, e.g., a Hyper Text Markup Language 
(HTML) document and forms another tagged document 
containing original sentences and translated sentences. 
The present invention also relates to a recording medi- 
um for recording such a tagged document. 
[0002] With the recent proliferation of personal com- 
puters and communication apparatus, people have be- 
come able to use communication networks represented 
by the Internet, i.e., Internet protocol (IP) communica- 
tion networks, and to easily obtain various sorts of infor- 
mation through the networks. Ordinarily, the World Wide 
Web in the Internet generally uses HTML as a language 
for describing information. Dynamic HTML (DHTML) 
and Extensible Markup Language (XML) are other lan- 
guages presently used to form tagged documents. 
[0003] Conventionally, to form home pages (also 
called Web pages) containing the same information de- 
scribed in different languages, e.g., English and Japa- 
nese, the process of separately forming each home 
page is required. That is, thesteps of forming sentences 
separately in each language, pasting common images 
including graphs and figures, separately setting links 
from each of the English home page and the Japanese 
home page, etc., are required. 
[0004] Home pages are open to the public on the In- 
ternet and can be read by people everywhere in the 
world. Therefore, people who set up their home pages 
for various readers to read are making home pages hav- 
ing the same formal and contents but having sentences 
written in different languages. 

[0005] The English and Japanese home pages 
formed as described above need scrupulous attention 
for maintenance because their English and Japanese 
sentences after correction must be equivalent in mean- 
ing and format, 

[0006] To form such English and Japanese home pag- 
es, it is necessary to make and manage two kinds of 
HTML document files for the English home page and 
the Japanese home page. That is, the number of files 
to be managed is increased in proportion to the number 
of languages, and management and maintenance be- 
comes difficult. 

[0007] A user who wishes to read document informa- 
tion in an English home page in Japanese may translate 
the English document information by using Internet 
translation software. However, if the user wishes to edit 
the results of the translation displayed as an HTML doc- 
ument, he or she must give up the idea of doing so or is 
further required to translate the translation results by dif- 
ferent translation software, because direct editing of the 
translated HTML document is impossible. 
[0008] If the user dares to edit the translation-result 
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HTML document, he or she must perform the steps of 
storing the translation-result HTML document on a local 
disk, opening the HTML document file stored on the lo- 
cal disk by using HTML document editing software, dis- 

5 playing the HTML document source, directly editing the 
HTML document source, and storing the results of the 
editing on the local disk. This process enables editing 
of the translation results to some effect. However, it is 
difficult to edit a document in which HTML tags, original 

10 sentences, and translated sentences are mixed. 

[0009] Further, in a case where an HTML document 
intended as an object of translation is prepared in ad- 
vance and, from this document, another HTML docu- 
ment described in a different language is formed by 

'5 translation processing using Internet translation soft- 
ware, a need may arise to edit the HTML document in 
the second language formed by the translation process- 
ing and, if necessary, the translation-object HTML doc- 
ument, if the author of the HTML document is notsatis- 

20 fied with the results of the translation. 

[0010] In this editing, it is difficult to determine docu- 
ment portions to be edited and to confirm the corre- 
spondence between original and translated sentences, 
since the translation-object HTML document and the 

25 translation-result HTML document exist in separate 
files. It is also possible that, through editing, the page 
configuration (format) of one document will become dif- 
ferent from that of the other. 

[0011] As described above, the conventional HTML 
so document processing apparatus can be designed to en- 
able translation of an original home page on the Internet 
using Internet translation software and visual display of 
original and translated sentences in a juxtaposed form. 
However, in editing translation results, an HTML docu- 
35 ment itself cannot be edited. There is a way to directly 
edit the HTML document source, but editing in such a 
way is extremely troublesome and not satisfactorily ef- 
fective, 

40 SUMMARY OF THE INVENTION 

[0012] In view of the above-described circumstances, 
an object of the present invention is to provide a docu- 
ment processing apparatus and a recording medium 
45 which make it possible to easily form and maintain a 
home page (Web page data) expressed in two or more 
languages. 

[0013] Another object of the present invention is to 
provide a document processing apparatus and a record- 
so ing medium which make it possible to easily edit trans- 
lated sentences obtained as a result of translation of a 
tagged document. 

[0014] Still another object of the present invention is 
to provide a document processing apparatus and a re- 
55 cording medium which make it possible to selectively 
display original sentences in a tagged document and 
translated sentences obtained as a result of translation 
of the tagged document. 
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[0015] A further object of the present invention is to 
provide a document processing apparatus and a record- 
ing medium which make it possible to return the edited 
translation sentences of the result of translation into a 
tagged document and to use it. 
[0016] A further object of the present invention is to 
provide a document processing apparaLus and a record- 
ing medium which make it possible to easily edit original 
sentences in a tagged document to from a more favo- 
rable translation. 

[0017] To achieve the above-described objects, ac- 
cording to one aspect of the present invention, there is 
provided a document processing apparatus comprising 
language tag setting means for setting a language tag 
designating a kind of language at each of constituent 
unit positions in an original and those in a translated ver- 
sion corresponding to the original, and document form- 
ing means for forming a tagged document including the 
original and the translated version each having the lan- 
guage tag set therein. 

[0018] The above-described document forming 
means may form a tagged document in which the orig- 
inal and the translated version each having the lan- 
guage tag set therein are described in an original-vcr- 
sus-version form. 

[0019] The above-described document processing 
apparatus further comprises translation means for 
translating the original to make the corresponding trans- 
lated version. 

[0020] The original may be contained in the process- 
ing-object tagged document. 

[0021] The above-described document processing 
apparatus further comprises visibility control tag setting 
means for setting a visibility control tag for controlling 
any document portion so that the document portion is 
invisible, the visibility control tag setting means setting 
the visibility control tag at such a position, that one of 
the original and the translated version is in an invisible 

[0022] The visibility control tag setting means may sot 
the visibility control tag at such a position that one of the 
original and the translated version is set in an invisible 
state and the language tag is also set in an invisible 
state. 

[0023] The above-described document processing 
apparatus further comprises display processing means 
for interpreting the tag contained in the tagged docu- 
ment and for enabling a browser for displaying the 
tagged document to display the formed tagged docu- 
ment on the basis of a result of the interpretation in the 
above document processing apparatus. 
[0024] According to another aspect of the present in- 
vention, there is provided a document processing appa- 
ratus comprising language tag setting means for setting 
a language tag at each of constituent unit positions in a 
first text described in a first language and those in a sec- 
ond text described in a second language of the first text, 
and document forming means forforming a tagged doc- 



ument including the first text and the second text each 
having the language tag set therein. 
[0025] The above-described document forming 
means may form a tagged document in which the first 
s text and the corresponding second text each having the 
language tag set therein are described by being related 
to each other. 

[0026] The above-described document processing 
apparatus further comprises visibility control tag setting 

10 means for setting a visibility control tag for controlling a 
text so that the text is invisible, the visibility control tag 
setting means selling the visibilily control tag at such a 
position that one of the first text and the second text is 
in an invisible state. 

15 [0027] Thevisibilitytag setting means may setthe vis- 
ibility control tag at such a position that one of the first 
text and the second text is set in an invisible state and 
the language tag is also set in an invisible state. 
[0028] The above-described document processing 

20 apparatus further comprises display processing means 
for interpreting the tag contained in the tagged docu- 
ment and for enabling a browser for displaying the 
tagged document to display the formed tagged docu- 
ment on the basis of a result of the interpretation. 

25 [0029] According to still another aspect of the present 
invention, there is provided a document processing ap- 
paratus comprising analysis means for determining and 
extracting an original text from a processing-object 
tagged document, and editing means for enabling edit 

30 processing of the original text extracted from the tagged 
document by displaying the original text. 
[0030] The analysis means may determine the origi- 
nal text according to a language tag contained in the 
processing-object tagged document. 

35 [0031] The above-described document processing 
apparatus further comprises translation processing 
means for making a translated version by translating the 
original text edited by the editing means. 
[0032] The above-described document processing 

40 apparatus further comprises restoration means for re- 
storing, from the original text after the editing and the 
translated version made by the translation processing 
means, a tagged document in the same format as the 
processing-object tagged document according to tags 

45 contained in the processing-object tagged document, 
the tags including the language tag. 
[0033] According to yet another aspect of the present 
invention, there is provided a document processing ap- 
paratus comprising analysis means for determining an 

so original text and a translated version corresponding to 
the original text in a processing-object tagged docu- 
ment, and editing means for enabling edit processing of 
the original text and the translated version by extracting 
the original text and the translated version from the 

55 tagged document and by displaying the original text and 
the translated version in an original-versus-version 
form. 

[0034] The analysis means may determine the origi- 
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nal text and the translated version according to lan- 
guage tags contained in the processing-object tagged 
document. 

[0035] The above-described document processing 
apparatus further comprises translation processing 
means for making a translated version by translating the 
original text edited by the editing means. 
[0036] The above-described document processing 
apparatus further comprises restoration means for re- 
storing, from the original text after the editing and the 
translated version made by the translation processing 
means, a tagged document in the same format as the 
processing-object tagged document according to tags 
contained in the processing-object tagged document, 
the tags including the language tag. 
[0037] According to a further aspect of the present in- 
vention, there is provided a recording medium readable 
by a computer, the recording medium having a program 
recorded thereon, the program for enabling the compu- 
ter to execute a step of setting a language tag designat- 
ing a kind of language at each of constituent unit posi- 
tions in an original and those in a translated version cor- 
responding to the original, and a step of forming a 
tagged document including the original and the translat- 
ed version each having the language tag set therein. 
[0038] The program recorded on this recording medi- 
um may enable the computer to further execute a step 
of setting a visibility control tag for controlling any doc- 
ument portion so that the document portion is invisible, 
the visibility control tag being set at such a position that 
one of the original and the translated version is in an 
invisible state. 

[0039] According to still afurther aspect of the present 
invention, there is provided a recording medium reada- 
ble by a computer, the recording medium having a pro- 
gram recorded thereon, the program for enabling the 
computer to execute a step of setting a language tag 
designating a kind of language at each of constituent 
unit positions in a first text described in a first language 
and those in a second text described in a second lan- 
guage of the first text, and a step of forming a tagged 
document including the first text and the second text 
each having the language tag set therein. 
[0040] The program recorded on this recording medi- 
um may enable the computer to further execute a step 
of setting a visibility control tag for controlling a text so 
that the document is invisible, the visibility control tag 
being set at such a position that one of the first text and 
the second text is in an invisible state. 
[0041 ] According to still a further aspect of the present 
invention, there is provided a recording medium reada- 
ble by a computer, the recording medium having a pro- 
gram recorded thereon, the program for enabling the 
computer to execute a step of determining and extract- 
ing an original text from a processing-object tagged doc- 
ument, and a step of enabling editing of the text extract- 
ed from the tagged document by displaying the original 
text. 



[0042] The program recorded on this recording medi- 
um may enable the computer to further execute a step 
of making a translated version by translating the original 
text after editing of the original text, and a step of restor- 

s ing, from the original text after the editing and the trans- 
lated version made by the translation, a tagged docu- 
ment in the same format as the processing-object 
tagged document according to tags contained in the 
processing-object tagged document, the tags including 

10 a language tag designating a kind of language. 

[0043] According to still afurtheraspect of the present 
invention, there is provided a recording medium reada- 
ble by a computer, the recording medium having a pro- 
gram recorded thereon, the program for enabling the 

15 computer to execute a step of determining an original 
text and a translated version corresponding to the orig- 
inal text in a processing-object tagged document, and a 
step of enabling editing of the original text and the trans- 
lated version by extracting the original text and thetrans- 

2° lated version from the tagged document and by display- 
ing the original text and the translated version in an orig- 
inal-versus-version form. 

[0044] The program recorded on this recording medi- 
um may enable the computer to further execute a step 

25 of forming a translated version by translating the original 
text after editing of the original text, and a step of restor- 
ing, from the original text after the editing and the trans- 
lated version made by the translation, a tagged docu- 
ment in the same format as the processing-object 

so tagged document according to tags contained in the 
processing-object tagged document, the tags including 
a language tag designating a kind of language. 
[0045] Accordingto still afurther aspect of the present 
invention, there is provided a recording medium reada- 

35 ble by a computer, the recording medium having a 
tagged document recorded thereon, the recorded 
tagged document comprising a first language tag des- 
ignating a first kind of language; a first text following the 
first language tag, the first text being described in a first 

40 language; a second language tag following the first text, 
the second language tag designating a second kind of 
language; and a second text following the second lan- 
guage tag, the second text being described in a second 
language and corresponding to the first text. 

45 [0046] Accordingtostillafurtheraspectofthepresent 
invention, there is provided a recording medium reada- 
ble by a computer the recording medium having a 
tagged document recorded thereon, the recorded 
tagged document comprising an invisibility start tag for 

50 setting a following text in an invisible state; a first lan- 
guage tag following the invisibility start tag, the first lan- 
guage tag designating a first kind of language; a first 
text following the first language tag, the first text being 
described in a first language; a second language tag fol- 

55 lowing the first text, the second language tag designat- 
ing a second kind of language; an invisibility end tag fol- 
lowing the second language tag, the invisibility end tag 
canceling the invisible state; and a second text following 
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the invisibility end tag, the second text being described 
in a second language and corresponding to thefirst text. 
[0047] According to a further aspect of the present in- 
vention, there is provided a document processing meth- 
od comprising a step of setting a language tag desig- s 
nating a kind of language at each of constituent unit po- 
sitions in an original and those in a translated version 
corresponding to the original; and a step of forming a 
tagged document including the original and the translat- 
ed version each having the language tag set therein. io 
[0048] The above-described document processing 
method further comprises a step of setting a visibility 
control tag for controlling any document portion so that 
the document portion is invisible, said visibility control 
tag being set at such a position that one of the original 15 
and the translated version is in an invisible state. 
[0049] According to still afurther aspect of the present 
invention, there is provided a document processing 
method comprising a step of setting a language tag des- 
ignating a kind of language at each of constituent unit 20 
positions in a first text described in a first language and 
those in a second text described in a second language 
of the first text; and a step of forming a tagged document 
Including the first text and the second text each having 
the language tag set therein. 25 
[0050] The above-described document processing 
method further comprises a step of setting a visibility 
control tag for controlling a text so that the document is 
invisible, said visibility control tag being set at such a 
position that one of the first text and the second text is so 
in an invisible state. 

[0051 ] Acco rdi ng to stil I a f u rth er aspect of th e present 
invention, there is provided a document processing 
method comprising a step of determining and extracting 
an original text from a processing-object tagged docu- 35 
ment; and a step of enabling editing of the original text 
extracted from the tagged document by displaying the 
original text. 

[0052] The above-described document processing 
method further comprises a step of making a translated 40 
version by translating the original text after editing of the 
original text; and a step of restoring, from the original 
text after the edit processing and the translated version 
formed by said translation, a tagged document in the 
sameformatastheprocessing-objecttagged document 45 
according to tags contained in the processing-object 
tagged document, said tags including a language tag 
designating a kind of language. 
[0053] Accordi ng to stil I a further aspect of the present 
invention, there is provided a document processing so 
method comprising a step of determining an original text 
and a translated version corresponding to the original 
text in a processing-object tagged document; and a step 
of enabling edit processing of the original text and the 
translated version by extracting the original text and the 55 
translated version from the tagged document and by dis- 
playing the original text and the translated version in an 
original-versus-version form, 
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[0054] The above-described document processing 
method further comprises a step of forming a translated 
version bytranslatingthe original text after editingofthe 
original text; and a step of restoring, from the original 
text after the edit processing and the translated version 
made by said translation, a tagged document in the 
same format as the processing-object tagged docu- 
ment, according to tags contained in the processing-ob- 
ject tagged document, said tags including a language 
tag designating a kind of language. 
[0055] According to the present invention, it is not 
necessary to form and manage a file with respect to 
each of aplurality of languages, so that the maintenance 
can be easily carried out. 

[0056] According to the present invention, an original 
text and a translated version of the original text obtained 
by translation are displayed in an original-versus-ver- 
sion form such that edit processing and version can be 
easily carried out. 

[0057] According to the present invention, an original 
text in a tagged document and a translated version of 
the original text obtained by translation can be selective- 
ly displayed. 

[0058] Further, according to the present invention, an 
edited translated version of the result of translation can 
be reformatted to the tagged document and used. 
[0059] Further, according to the present invention, an 
original text in a tagged document can be easily edited 
to obtain a more favorable translated version. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0060] These objects and advantages of the present 
invention will become more apparent and more readily 
appreciated from the following detailed description of 
the presently preferred exemplary embodiments, taken 
in conjunction with the accompanying drawings of 
which: 

Fig. 1 is a block diagram showing the configuration 
of an HTML document processing apparatus which 
represents a first embodiment of the present inven- 
tion; 

Fig 2 is a diagram schematically showing the for- 
mat and contents of an HTML document; 
Fig. 3 is a diagram explaining a state where HTML 
documents are combined into one file and stored, 
and an edited state by a translation editor; 
Fig. 4 is a flowchart for explaining the operation of 
the HTML document processing apparatus of the 
first embodiment; 

Fig. 5 is another flowchart for explaining the opera- 
tion of the HTML document processing apparatus 
of the first embodiment; 

Fig. 6 is a diagram showing a specific example of 
an ohginal-versus-version HTML document source; 
Fig. 7 is a block diagram showing the configuration 
of an HTML document processing apparatus which 



5 



9 



EP 1 139 231 A1 



10 



represents second and third embodiments of the 
present invention; 

Fig. 8 is a flowchart for explaining the operation of 
the HTML document processing apparatus of the 
second embodiment; 

Fig. 9 is another flowchart for explaining the opera- 
tion of the HTML document processing apparatus 
of the second embodiment; 
Fig. 1 0 is a diagram showing an example of an initial 
window of an English-Japanese translation editor 
(original-versus-version editor program) ; 
Fig. 11 is a flowchart for explaining the operation of 
the HTML document processing apparatus of the 
third embodiment; and 

Fig. 12 is another flowchart for explaining the oper- 
ation of the HTML document processing apparatus 
of the third embodiment. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

[0061] Embodiments of the present invention will be 
described with reference to the accompanying draw- 
ings. 

[First Embodiment] 

[Configuration of HTML Document Processing 
Apparatus] 

[0062] Fig. 1 shows functional components of an 
HTML document processing apparatus 1 which repre- 
sents a first embodiment of the present invention. As 
shown in Fig. 1 , the HTML document processing appa- 
ratus 1 is connected to a World Wide Web (WWW) serv- 
er 3 through an IP communication network, i.e., Internet 
2. In this embodiment, the WWW server 3 holds English 
home pages (Web pages in which contents are de- 
scribed in English). 

[0063] The HTML document processing apparatus 1 
can be realized by a personal computer, and has an in- 
put section 10, a display section 11 , a display processing 
section 12, a communication control section 13, an au- 
tomatic translation processing section 14, an original- 
versus-version HTML document forming processing 
section 15, an individual-language HTML document 
conversion processing section 16, a data storage sec- 
tion 17, etc. The HTML document processing apparatus 
1 is connected to a WWW browser 4 provided as an In- 
ternet display tool. 

[0064] The personal computer has, as is well known, 
a central processing unit (CPU), a random access mem- 
ory (RAM), a hard disk, a drive unit for accessing a port- 
able recording medium, such as a floppy disk or a com- 
pact disk-read only memory (CD-ROM), to read out a 
program or data recorded thereon, a communication 
control unit, such as a modem or a local area network 
(LAN) board, a display, a keyboard, and a mouse. 



[0065] The input section 10 of the HTML document 
processing apparatus 1 in this embodiment corre- 
sponds to the keyboard and the mouse; the display sec- 
tion 11 to the display; the data storage section 17 to the 
s RAM, the hard disk or the portable recording medium; 
and the communication control section 13 to the com- 
munication control unit. 

[0066] The display processing section 12, the auto- 
matic translation processing section 1 4, the original-ver- 

10 sus-version HTML document forming processing sec- 
tion 15, and the individual-language HTML document 
conversion processing section 16 are realized by mak- 
ing the CPU execute a program of the present invention 
and a program of Internet translation software. 

15 [0067] The program of the present invention and the 
Internet translation software are provided by being re- 
corded on the portable recording medium, and are read 
out by the drive unit and stored in the hard disk in ad- 
vance. 

20 [0068] The above-described program and software 
stored in an external unit, may be downloaded to the 
HTML document processing apparatus 1 through the 
network using the communication control unit, and 
stored in the hard disk. 

25 [0069] The program of the present invention and the 
Internet translation software stored in the hard disk are 
read to the RAM to be executed by the CPU. The pro- 
grams of the present invention and the Internet transla- 
tion software may be directly read to the RAM without 

so being stored in the hard disk, instead of being temporal- 
ity stored in the hard disk and thereafter read to the RAM 
as described above. 

[0070] HTML documents formed by the HTML docu- 
ment processing apparatus 1 in this embodiment of the 

35 present invention are stored in the RAM, the hard disk 
or the portable recording medium. 
[0071] In the HTML document processing apparatus 
1 arranged as described above, the automatic transla- 
tion processing section 14 translates an English HTML 

40 document (an original including sentences and figures) 
of an English home page described in English into a Jap- 
anese HTML document (a translated version) described 
in Japanese by using the home page translation func- 
tions of the Internet translation software, e.g., "ATLAS" 

45 (trademark of FUJITSU LIMITED), and the original and 
the results of the translation are visually displayed so 
that pairs of original and translated paragraphs or sen- 
tences are juxtaposed in upper and lower positions, as 
described below in detail, 

so [0072] In this translation and display process, the orig- 
inal-versus-version HTML document forming process- 
ing section 15 attaches a language tag, which is a spe- 
cial tag for identification of the kind of language, to the 
beginning of each of paragraphs or sentences which are 

55 constituent units of the original document and the trans- 
lated document. The individual-language HTML docu- 
ment conversion processing section 1 6 recognizes the 
attached language tag to enable selective display of on- 
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ly one of the original and translated documents by fur- 
ther adding an invisibility tag (visibility control tag) to in- 
struct to make no visual display. 

[Operation of HTML Document Processing Apparatus] 

[0073] The operation of the HTML document process- 
ing apparatus 1 in the first embodiment of the present 
invention shown in Fig. 1 will now be described. Fig. 2 
schematically shows the compositions and contents of 
original and translated documents and a state where 
these documents are stored and displayed by being jux- 
taposed in the above-mentioned manner. Fig. 3 shows 
a state where an original HTML document and aversion 
of the HTML document are stored and displayed while 
being combined into one file in such a form that each of 
pairs of original and translated sentences are juxta- 
posed in left and right positions. Figs. 4 and 5 are flow- 
charts for explaining the operation of the HTML docu- 
ment processing apparatus 1 . 

(Original-versus-version HTML Document Forming 
Processing) 

[0074] The operation will be described by way of ex- 
ample with respect to a case where, in the HTML doc- 
ument processing apparatus 1 , an English HTML doc- 
ument (original) of an English home page described in 
English is translated into a Japanese HTML document 
(translated version) described in Japanese, and an orig- 
inal-versus-version HTML document is formed from the 
original and version and is visually displayed. 
[0075] Referring also to Figs. 1 through 5, a user using 
the HTML document processing apparatus 1 searches 
the Internet for a home page described in a language 
which he or she wishes to translate, by operating the 
input section 1 0 and based on the contents of a display 
made by the display section 11 (WWW browser). In this 
example, the object of this search (processing step 
S401 in Fig. 4) is an English home page held by the 
WWW server 3. 

[0076] Next, the user initiates a program for transla- 
tion in the desired direction by operating the input sec- 
tion 1 0 and based on the contents of the display made 
by the display section 1 1 . In this example, the user wish- 
es to translate an English HTML document of the Eng- 
lish home page into a Japanese HTML document de- 
scribed in Japanese and therefore initiates an English- 
into-Japanese translation program (S402). 
[0077] The user clicks a "translation" button displayed 
in an initial window (dialogue window) (not shown) of the 
translation program to make the HTML document 
processing apparatus 1 translate and arrange the Eng- 
lish HTML document in such a mannerthat each of pairs 
of original and translated paragraphs or sentences can 
be displayed by being juxtaposed in upper and lower 
positions (S403). According to the user's "translation" 
button clicking operation, the automatic translation 



processing section 14 having the translation program 
read out from the hard disk, translates the English HTML 
document supplied from the WWW server 3 through the 
communication control section 1 3 into a Japanese doc- 
s ument, and thereby forms an HTML document in which 
each of a certain number of English paragraphs or sen- 
tences are followed by a line break tag and a corre- 
sponding Japanese paragraph or sentence, that is, 
each of pairs of original and translated paragraphs or 
10 sentences are displayed by being juxtaposed in upper 
and lower positions. Known techniques can be used for 
the process of translation in the automatic translation 
processing section 14, and therefore such translation 
techniques will not be described in detail in this specie- 
's cation. 

[0078] The original-versus-version HTML document 
forming processing section 15, supplied with the results 
of translation performed by the automatic translation 
section 14, i.e., the HTML document containing Japa- 

20 nese sentences, starts analyzing the HTML document 
(S404) to add language tags for enabling discrimination 
between the original and translated paragraphs or sen- 
tences in the individual-language HTML document con- 
version processing section 1 6 described below in detail. 

25 [0079] The original-versus-version HTML document 
forming processing section 15 first recognizes the part 
of a text of the inputted HTML document. The original- 
versus-version HTML document forming processing 
section 15 detects an HTML tag such as specifically 

30 shown in Fig. 6 (shown in the space between symbols 
< and > in Fig. 6) to recognize the section following this 
tag as English texts and Japanese texts (S405). 
[0080] Next, the original-versus-version HTML docu- 
ment forming processing section 15 determines, if the 

35 texts recognized in step S405 are an English paragraph 
or sentence (original) or a Japanese paragraph or sen- 
tence (translated version) according to the position of 
the paragraphs or sentences (in the order of an English 
paragraph orsentence, a line breaktag, anda Japanese 

40 paragraph or sentence) (step S406). If the original-ver- 
sus-version HTML document forming processing sec- 
tion 15 recognizes an English paragraph orsentence as 
a result of this determination, it embeds a language tag 
<E> at the beginning of the document-component-unit 

45 paragraph or sentence (S407). If it recognizes a Japa- 
nese paragraph or sentence, it embeds a language tag 
<J> at the beginning of the document-component-unit 
paragraph orsentence (S408). 
[0081] The original-versus-version HTML document 

so forming processing section 15 proceeds to processing 
of the next paragraph in the original-versus-version 
HTML document, and makes a determination as to 
whether an HTML tag </html> indicating the end of the 
HTML document is detected or not (steps S409, S410). 

55 The original-versus-version HTML document forming 
processing section 15 repeats the above-described 
steps S404 to S41 0 until the HTML tag indicating the 
end of the HTML document is detected. 
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[0082] The original-versus-version HTML document 
forming processing section 1 5 stores in the data storage 
section 1 7 an original-versus-version HTML document 
obtained as a result of the original-versus-version HTML 
document forming processing based on the above-de- 
scribed sequence of document analysis steps, when the 
HTML tag indicating the end of the HTML document is 
detected. The state of data thereby stored is shown as 
an original-versus-version HTML document source 21 
in Fig. 2. In Fig. 2, symbols A, and V indicate English 
sentences and symbols O, and ©indicate Japanese 
sentences. 

[0083] After the original-versus-version HTML docu- 
ment forming processing performed by the original-ver- 
sus-version HTML document forming processing sec- 
tion 15 based on the above-described sequence of doc- 
ument analysis steps, the display processing section 1 2 
visually displays the stored original-versus-version 
HTML document in the upper-lower juxtaposition form 
on the screen of the display section 11 (WWW browser 
4) (step S411). 

[0084] The original-versus-version HTML document 
source 21 is displayed in the upper-lower juxtaposition 
form, as indicated by reference numerals 20 in Fig. 2. 
In Fig. 2, "imagel " and "image2" represent figures in the 
document. 

(Processing for Selectively Displaying English HTML 
Document (Original) and Japanese HTML Document 
(version)) 

[0085] In the HTML document processing apparatus, 
processing described below is performed after the orig- 
inal-versus-version HTML document forming process- 
ing based on the above-described sequence of docu- 
ment analysis steps. The processing described below 
enables selective display of only one of the English 
HTML document (original) and the Japanese HTML 
document (translated version) displayed in the upper- 
lower juxtaposition form on the screen of the display 
section 11. 

[0086] The user determines display of the document 
in the particular language for selective display of the 
document, and clicks a button corresponding to the lan- 
guage to be displayed (a "Japanese" button in this ex- 
ample) (S412 in Fig. 5). In response to the user's "Jap- 
anese" button clicking operation, the HTML document 
processing apparatus 1 hands over processing to the 
individual-language HTML document conversion 
processing section 16 to enable selective display of only 
the English HTML document (original) or the Japanese 
HTML document (translated version). In the case where 
the objective languages are English and Japanese as 
in this example, "English" and "Japanese" buttons (not 
shown) are displayed. 

[0087] The individual-language HTML document con- 
version processing section 1 6 reads out the original-ver- 
sus-version HTML document source 21 to the data stor- 
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age section 1 7 as shown in Fig. 2, and starts analyzing 
the document (S413). The individual-language HTML 
document conversion processing section 16 makes a 
determination as to whether or not a language tag <E> 

s or <J> embedded in the original-versus-version HTML 
document source 21 has been detected (S414). 
[0088] If the individual-language HTML document 
conversion processing section 16 has detected a lan- 
guage tag in the English original-vs. -Japanese version 

10 HTML document, it sets an invisibility start tag <INVIS- 
IBILITY START> as a visibility control tag immediately 
before the detected language tag (S415). 
[0089] Thereafter, to enable visual display of only the 
Japanese sentences in the English original-vs. -Japa- 

15 nese version HTML document, the individual-language 
HTML document conversion processing section 16 
makes a determination as to whether or not the lan- 
guage tag <J> is detected (S416). If the individual-lan- 
guage HTML document conversion processing section 

20 1 6 has detected a language tag <J>, it sets an invisibility 
end tag INVISIBILITY END> as a visibility control tag 
immediately after the detected tag (immediately before 
a translated sentence) (S417). 
[0090] The well-known comment tag may be used as 

25 the above-mentioned visibility control tag. A sentence 
or paragraph bracketed by a pair of comment tags is not 
displayed by the WWW browser 4. 
[0091] If the individual-language HTML document 
conversion processing section 16 recognizes the de- 

so tected language tag as a language tag other than the 
language tag <J> in above step S41 6, it sets an invisi- 
bility end tag INVISIBILITY END> at the end of the par- 
agraph (S418). This process is effective in processing 
an original-versus-version HTML document containing 

35 sentences written in three or more languages. For ex- 
ample, if the original-versus-version HTML document 
contains German sentences (original) together with 
English sentences (original) and Japanese sentences 
(translated version), this process makes the German 

40 sentences invisible as well as the English sentences. 
[0092] An original-versus-version HTML document 
22 having invisibility tags INVISIBILITY START>, IN- 
VISIBILITY END> as visibility control tags added thereto 
as well as language tags <E> and <J>, as shown in Fig. 

45 2, is formed in this mannerto display Japanese sentenc- 
es only. Fig. 2 shows this HTML document with source 
codes. 

[0093] If no language tag is detected in above step 
S414, or after the completion of processing of the step 

50 S417 or 41 8, the individual-language HTML document 
conversion processing section 16 proceeds to process- 
ing of the next paragraph in the original-versus-version 
HTML document, and makes a determination as to 
whether or not an HTML tag </html> indicating the end 

55 of the HTML document has been detected (steps S41 9, 
S420). The individual-language HTML document con- 
version processing section 16 repeats the above-de- 
scribed steps S41 3 to S420 until the HTML tag indicat- 
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ing the end of the HTML document is detected. 
[0094] The individual-language HTML document con- 
version processing section 1 6 stores in the data storage 
section 1 7 the HTML document with invisibility tags ob- 
tained as a result of the above-described sequence of 
process steps for selective display of the original-ver- 
sus-version HTML document when the HTML tag indi- 
cating the end oftheoriginal-versus-version HTML doc- 
ument is detected. 

[0095] The display processing section 1 2 then opens 
the HTML document with invisibility tags by the WWW 
browser 4, and visually displays the document on the 
screen of the display section 11 (S421). A Japanese 
sentence HTML display 23 shown in Fig. 2 represents 
the state of this display on the screen. 
[0096] Further, Fig. 2 shows the source code for en- 
abling display of English sentences only in an original- 
versus-version HTML document 24, and shows an 
HTML display 25 which represents the state of display 
on the screen when the HTML document 24 is displayed 
by the WWW browser 4. 

[Second Embodiment] 

[Configuration of HTML Document Processing 
Apparatus] 

[0097] Fig. 7 shows functional components of an 
HTML document processing apparatus 1 which repre- 
sents a Embodiment 2 of the present invention. The 
HTML document processing apparatus 1 is connected 
to a World Wide Web (WWW) server 3 through an IP 
communication network, i.e., Internet 2. In this embod- 
iment, the WWW server 3 holds English home pages. 
[0098] The HTML document processing apparatus 1 
can be realized by a personal computer, and has an in- 
put section 10,adisplay section 11, a display processing 
section 12, a communication control section 13, an au- 
tomatic translation processing section 14, an original- 
versus-version HTML document forming processing 
section 15, an individual-language HTML document 
conversion processing section 16, a data storage sec- 
tion 17, an HTML document analysis and conversion 
processing section 18, an English-Japanese translation 
editor (original-versus-version editor program) 19, an 
original-versus-version HTML document conversion 
processing section 40, etc. The HTML document 
processing apparatus 1 is connected to a WWW brows- 
er 4 provided as an Internet display tool. 
[0099] In the HTML document processing apparatus 
arranged as described above, the automatic translation 
processing section 14 translates an English HTML doc- 
ument (an original including sentences and figures) of 
an English home page described in English into a Jap- 
anese HTM L document (a translated version) described 
in Japanese, by using the home page translation func- 
tions of the above-mentioned Internet translation soft- 
ware, and the original and the results of the translation 
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are visually displayed in such a form that each of pairs 
of original and translated paragraphs or sentences are 
juxtaposed in upper and lower positions. 
[0100] In this translation and display process, the orig- 

s inal-versus-version HTML document forming process- 
ing section 15 attaches a language tag, which is a spe- 
cial tag for determination of the kind of language, to the 
beginning of each of paragraphs or sentences which are 
constituent units of the original document and thetrans- 

10 lated document. 

[0101] The HTML document analysis and conversion 
processing section 1 8 and the English-Japanese trans- 
lation editor (original-versus-version editor program) 19 
extract data with respect to each language, when data 

15 is edited in an original-versus-version form. After editing 
of original-versus-version data performed by the editor, 
HTML tags are restored to form an HTML document 
from the edited data. 

[0102] If an HTML document is translated by Internet 
20 translation software, translation results outputted in an 
original-versus-version form cannot be edited. Howev- 
er, document data prepared by removing HTML tags 
from the translation results is supplied to the editor ca- 
pable of editing data in an original-versus-version form 
25 to enable editing of the translation results. Also, HTML 
tags are restored in edit results to enable the edit results 
to be used as an HTML document. 

[Operation of HTML Document Processing Apparatus] 

[0103] The operation of the HTML document process- 
ing apparatus 1 in Embodiment 2 of the present inven- 
tion shown in Fig. 7 will now be described. Figs. 8 and 
9 are flowcharts for explaining the operation of the 
35 HTML document processing apparatus 1 . 

(Original-versus-version HTML Document Forming 
Processing) 

40 [0104] Referring to Figs. 7 through 9, in the HTML 
document processing apparatus 1, an English HTML 
document (original) of an English home page described 
in English is translated, an HTML document including 
the formed original and translated version is analyzed, 

45 and an original-versus-version HTML document having 
language tags embedded therein is formed and visually 
displayed. This document forming and displaying 
processing is the same as that performed by the HTML 
document processing apparatus 1 Embodiment 1 de- 

so scribed above, i.e., the process steps S401 to S414 
shown in Fig. 4. 

(Original-versus-version HTML Document Data 
Conversion Processing and Translated Version Edit 
55 Processing) 

[0105] In the HTML document processing apparatus 
1 , after the above-described processing of forming an 



EP1 139 231 A1 



9 



BNSDOCID: <EP 1139231A1J_> 



17 



EP 1 139 231 A1 



18 



original-versus-version HTML document displayed in an 
upper-lower juxtaposition form on the screen of the dis- 
play section 1 1 , data conversion processing and trans- 
lated version edit processing can be performed in a 
process described below. s 
[0106] When a user clicks an "original-versus-version 
edit processing start" button, the automatic translation 
processing section 14 having translation software 
hands over processing to the HTML document analysis 
and conversion processing section 18 (S812 in Fig. 8) 10 
to enable editing of the Japanese HTML document (ver- 
sion) in the original-versus-version HTML document 20 
shown in Fig. 2. 

[01 07] The HTML document analysis and conversion 
processing section 1 8 starts processing for enabling ed- '5 
iting using the English-Japanese translation editor (orig- 
inal-versus-version editorprogram) 19 by removing lan- 
guage tags from the original-versus-version HTML doc- 
ument source 21 having language tags <E> and <J> 
added thereto (S81 3). 20 
[0108] First, the HTML document analysis and con- 
version processing section 18 makes a determination 
as to whether a language tag <E> or <J> embedded in 
the original-versus-version HTML document in the orig- 
inal-versus-version HTML document forming process- 25 
ing section 15 has been detected (S814). 
[0109] If the HTML document analysis and conver- 
sion processing section 1 8 has detected a language tag 
<E>or<J> in the English original-versus-Japanese ver- 
sion HTML document, it stores in the data storage sec- so 
tion 17 language data together with language tag start 
and end addresses (S815). 

[0110] Thereafter, to enable display and edit of Eng- 
lish sentences in the English original-vs.-Japanese ver- 
sion HTML document, the HTML document analysis 35 
and conversion processing section 1 8 makes a deter- 
mination as to w h eth er o r n ot th e detected I an g u age tag 
is a language tag <E> (S81 6). If the detected language 
tag is a language tag <E>, the HTML document analysis 
and conversion processing section 1 8 stores in an orig- 
inal document storage area in the data storage section 
1 7, the start address of a document constituent unit par- 
agraph (or sentence) and the contents of the paragraph 

(5817) . 

[0111] If the result of above determination in step 
S816 detects a language tag <J>, the HTML document 
analysis and conversion processing section 18 stores 
in a version storage area in the data storage section 1 7, 
the start address of a document constituent unit para- 
graph (or sentence) and the contents of the paragraph 

(581 8) to enable Japanese sentence display and edit. 
As a result of the above process steps S815 to S818, 
English and Japanese paragraphs or sentences from 
which HTML tags have been removed and the start and 
end addresses of the paragraphs or sentences are 
stored in the data storage section 1 7. 

[0112] If no language tag is detected in above step 
S814, or after the completion of processing in above 



step S817 or 818, the HTML document analysis and 
conversion processing section 1 8 proceeds to process- 
ing of the next paragraph in the original-versus-version 
HTML document 21 , and makes a determination as to 
whether or not an HTML tag </html> indicating the end 
of the HTML document is detected (steps S81 9, S820). 
[0113] The HTML document analysis and conversion 
processing section 18 repeats the above-described 
steps S81 3 to S820 in cooperation with the English-Jap- 
anese translation editor (original-versus-version editor 
program) 19, until the HTML tag indicating the end of 
the HTML document is detected. Fig. 10 shows an ex- 
ample of an initial window of the English-Japanese 
translation editor. 

[0114] When the HTML tag indicating the end of the 
original-versus-version HTML document is detected, 
the HTML document analysis and conversion process- 
ing section 18 visually displays (S821), in the window of 
the English- Japanese translation editor (display section 
11), the results of the above-described sequence of 
process steps for converting the original-versus-version 
HTML document data, i.e., English paragraphs or sen- 
tences (original) and Japanese paragraphs or sentenc- 
es (translated version) from which the language tags 
and the HTML tags have been removed. 
[0115] The user edits the Japanese sentences ac- 
cording to his or her need on the basis of the English 
and Japanese paragraphs or sentences visually dis- 
played in the left-right juxtaposition form in the window 
of the English-Japanese translation editor. The symbol 
• in the translated sentences shown in Fig. 3 designates 
edited portions. When the user clicks an "end" button in 
the English-Japanese translation editor window after 
suitably editing the Japanese version, the English-Jap- 
anese translation editor (original-versus-version editor 
program) 1 9 stores the results of editing in the data stor- 
age section 1 7, thereby completing the edit processing 
(S822). 



[0116] In the HTML document processing apparatus 
1 , after the above-described original-versus-version 
HTM L document data conversion processing and trans- 
lated version editing processing, a process described 
below is further performed to enable restoration to the 
HTML document format. 

[01 1 7] When the user clicks an "HTML display" button 
in the English-Japanese translation editor window, the 
English-Japanese translation editor (original-versus- 
version editor program) 19 is started (S823). By an in- 
struction from the English-Japanese translation editor 
1 9, the original-versus-version HTML document conver- 
sion processing section 40 starts analyzing the edited 
document (S824). 

[0118] First, the original-versus-version HTML docu- 
ment conversion processing section 40 reads out one 



40 (Processing for Restoration to HTML Document 
Format) 
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of the English paragraphs or sentences (original) or the 
Japanese paragraphs or sentences (translated version) 
stored in the data storage section 17, and makes a de- 
termination as to whetherthe document constituent unit, 
i.e., the paragraph or sentence, is English (original) or s 
Japanese (translated version) (S825). This determina- 
tion can be definitely made based on the language tags 
stored in the data storage section 1 7. 
[01 19] If the result of this determination is that the par- 
agraph or sentence read out is English, the original-ver- '° 
sus-version HTML document conversion processing 
section 40 adds the language tag <E> to the beginning 
of the paragraph (S826). If the paragraph read out is 
Japanese, the original-versus-version HTML document 
conversion processing section 40 adds the language 15 
tag <J> to the beginning of the paragraph (S827). The 
original-versus-version HTML document conversion 
processing section 40 substitutes (S828) the paragraph 
of the edited document for the corresponding paragraph 
of the original-versus-version HTML document before 20 
editing by referring to the paragraph start address stored 
in the data storage section 17. 

[0120] The original-versus-version HTML document 
conversion processing section 40 then proceeds to 
processing of the next paragraph and makes a determi- 25 
nation as to whether or not the end of the edited docu- 
ment has been reached (steps S829, S830). The origi- 
nal-versus-version HTML document conversion 
processing section 40 repeats the above-described 
steps S824 to S830 until the end of the edited document 30 
is reached. 

[0121] The original-versus-version HTML document 
conversion processing section 40 stores in the data stor- 
age section 17 the original-versus-version HTML docu- 
ment obtained as a result of the above-described se- 35 
quence of HTML document restoration processing 
steps. The display processing section 1 2 opens, by the 
WWW browser 4, the original-versus-version HTML 
document stored in the data storage section 17, thereby 
making a display reflecting the editing results (S831). 40 
[0122] When the original-versus-version HTML docu- 
ment is displayed on the editor, the English paragraphs 
or sentences and the positions of the same, and the Jap- 
anese paragraphs or sentences and the positions of the 
same in the original-versus-version HTML document 45 
are stored in the respective storage areas in the data 
storage section 17, and the original-versus-version 
HTML document before editing is also held by the data 
storage section 17, From the information on the posi- 
tions of the English and Japanese paragraphs or sen- so 
tences stored in the storage areas, the other portions 
can be identified as the tags, in this manner, the posi- 
tions of the tags can be obtained reciprocally from the 
positions of the English and Japanese paragraphs or 
sentences, and HTMLdocument restoration processing 55 
can be performed by using the tags in the original-ver- 
sus-version HTML document before editing with refer- 
ence to the tag position information obtained from the 



paragraph or sentence positions. 
[0123] The formed original-versus-version HTML 
document can be displayed in the form of an original- 
versus-version home page by the Internet display tool 
WWW browser4 if it may formed without invisibility tags. 
Selective display processing (see Fig. 5) of the original- 
versus-version HTML document may be performed by 
the individual-language HTML document conversion 
processing section 1 6 if necessary, as in the HTML doc- 
ument processing apparatus 1 in above-described Em- 
bodiment 1 , thereby enabling only the Japanese HTML 
document (version) obtained by this processing to be 
visually displayed by the display section 1 1 . 

[Embodiment 3] 

[Configuration of HTML Document Processing 
Apparatus] 

[0124] An HTML document processing apparatus 1 
which represents Embodiment 3 of the present inven- 
tion has basically the same functions and configuration 
as the HTML document processing apparatus 1 of Em- 
bodiment 2 shown in Fig. 7. 

[0125] In the HTML document processing apparatus 
1 in the third embodiment, however, there is no need for 
the automatic translation processing section 1 4 and the 
original-versus-version HTML document forming 
processing section 15. 

[0126] In this HTML document processing apparatus 
1, the HTML document analysis and conversion 
processing section 18 and the English-Japanese trans- 
lation editor (original-versus-version editor program) 19 
cooperate to perform data conversion by removing 
HTML tags from an English HTML document (original) 
of an English home page described in English, thereby 
enabling editing of the original and inputting of a trans- 
lated version. A user edits the editable English sentenc- 
es according to his or her need and inputs Japanese 
sentences (translated version). 

[0127] Also, the original-versus-version HTML docu- 
ment conversion processing section 40 substitutes ed- 
ited English and Japanese sentences for the original 
English sentences, restores the removed HTML tags, 
and sets translation-result Japanese sentences (trans- 
lated version) in correspondence with the edited sen- 
tences, thereby enabling visual display of an original- 
versus-version HTML document containing a more fa- 
vorable version. 

[Operation of HTML Document Processing Apparatus] 

(Data Conversion Processing and Original Edit 
Processing of English HTML Document (Original)) 

[0128] Fig. 7 shows the configuration of this HTML 
document processing apparatus 1 , and Figs. 11 and 12 
are flowcharts for explaining the operation of this HTML 
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document processing apparatus 1 . Referring to Figs. 7, 
1 1 , and 1 2, a user using the HTML document processing 
apparatus 1 searches the Internet for a home page de- 
scribed in a language which he or she wishes to trans- 
late, by operating the input section 1 0 and based on the 
conlents or a display made by the display section 1 1 . In 
this example, the object of this search (processing step 
S1101 in Fig. 11) is an English home page held by the 
WWW server 3. 

[0129] Next, the user initiates a program for transla- 
tion in the desired direction by operating the input sec- 
tion 10 and based on the contents of the display made 
by the display section 11 . In this example, the English 
home page is to be finally translated into a Japanese 
HTML document described in Japanese. Therefore, an 
English-into-Japanese translation program is started 
(S1102). 

[0130] The user clicks an "original-versus-version edit 
processing start" button in an initial window (dialogue 
window) (not shown) of the translation program to start 
the English-Japanese translation editor (original-ver- 
sus-version editor program) 19 (S1103). 
[01 31 ] The HTML document analysis and conversion 
processing section 1 8 starts analyzing (S1 1 04) the Eng- 
lish HTML document (original) supplied from the WWW 
server 3 through the communication control section 13. 
First, the HTML document analysis and conversion 
processing section 18 recognizes a text portion of the 
English HTML document (original). The HTML docu- 
ment analysis and conversion processing section 1 8 de- 
tects an HTML tag to recognizethe section following this 
tag as an English text (S1105). 
[0132] Next, the HTML document analysis and con- 
version processing section 1 8 stores in the original stor- 
age area in the data storage section 1 7 the start address 
of a document constituent unit paragraph (or sentence) 
and the contents of the paragraph (S1106). As a result 
of the processing in the above steps S1105 and S1106, 
the English paragraph or sentence from which the 
HTML tag has been removed is stored in the data stor- 
age section 1 7. 

[01 33] If HTM L tag is detected in above step S1 1 05, 
or after the completion of processing in the step S11 06, 
the HTML document analysis and conversion process- 
ing section 1 8 proceeds to processing of the next para- 
graph in the English HTML document, and makes a de- 
termination as to whether or not an HTML tag </html> 
indicating the end of the HTML document is detected 
(steps S1 107, S1108). 

[0134] The HTML document analysis and conversion 
processing section 18 repeats the above-described 
steps S1104 to S1108 in cooperation with the English- 
Japanese translation editor (original-versus-version ed- 
itor program) 19 until the HTML tag indicating the end 
of the English HTML document is detected. 
[0135] When the HTML tag indicating the end of the 
English HTML document is detected, the HTML docu- 
ment analysis and conversion processing section 18 
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visually displays (S1109) in a left-hand section of the 
window of the English-Japanese translation editor (dis- 
play section 11), the results of the above-described se- 
quence of process steps for converting the English 

5 HTML document data. 

[0136] The user performs manual translation on the 
basis of the English paragraphs or sentences (original) 
visually displayed in the window of the English-Japa- 
nese translation editor, and forms translated sentences 

10 (Japanese sentences) by operating the input section 10. 
In this translation processing, the user edits the English 
sentences and, if necessary the Japanese translated 
sentences according to his or her need. When the user 
clicks an "end" button in the English-Japanese transla- 

15 tion editor window after suitably editing the original and 
completing the Japanese version, the English-Japa- 
nese translation editor (original-versus-version editor 
program) 1 9 stores the results of editing and translation 
in the data storage section 17, thereby completing the 

20 processing (S1110). 

(Processing for Restoration to HTML Document 
Format) 

25 [0137] In the HTML document processing apparatus 
1 , after the above-described English HTML document 
(original) data conversion processing and original edit- 
ing processing, a process described below is next per- 
formed to enable restoration processing of the HTML 

30 document format. 

[0138] When the userclicks an "HTMLdisplay" button 
in the English-Japanese translation editor window, the 
English-Japanese translation editor (original-versus- 
version editor program) 19 is again started (S1111). By 

35 an instruction from the English-Japanese translation ed- 
itor 19, the original-versus-version HTML document 
conversion processing section 40 starts analyzing the 
resulting edited document (S1112). 
[0139] First, the original-versus-version HTML docu- 

40 ment conversion processing section 40 reads out one 
of the English paragraphs or sentences (original) stored 
in the data storage section 1 7, and makes a determina- 
tion as to whether or not the document constituent unit, 
i.e., the paragraph (or sentence), is English (original) 

45 (S1113). This determination can be definitely made 
based on the storage positions stored in the left-right 
juxtaposition form in the data storage section 17. 
[0140] If the result of this determination is that the par- 
agraph or sentence read out is English, the original-ver- 
so sus-version HTML document conversion processing 
section 40 adds the language tag <E> to the beginning 
of the paragraph (S1114). If the paragraph read out is 
Japanese, the original-versus-version HTML document 
conversion processing section 40 adds the language 

55 tag <J> to the beginning of the paragraph (S1115). 
[0141] The original-versus-version HTML document 
conversion processing section 40 substitutes (S1116) 
the paragraph of the resulting edited original with the 
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language tag added thereto for the corresponding orig- 
inal paragraph of the HTML document before editing, by 
referring to the paragraph start address stored in the da- 
ta storage section 17, and inserts the translated sen- 
tence formed in the above-described step S1110 imme- 
diately after the substituted original paragraph (S11 1 7). 
[0142] The original-versus-version HTML document 
conversion processing section 40 then proceeds to 
processing of the next paragraph and makes a determi- 
nation as to whether or not the end of the edited docu- 
ment has been reached (steps S1118, S1119). The orig- 
inal-versus-version HTML document conversion 
processing section 40 repeats the above-described 
steps S1112 to S1119 until the end of the edited docu- 
ment is reached. 

[0143] The original-versus-version HTML document 
conversion processing section 40 stores in the data stor- 
age section 17 the original-versus-version HTML docu- 
ment obtained as a result of the above-described se- 
quence of HTML document restoration processing 
steps. The display processing section 12 then opens, 
by the WWW browser 4, the original-versus-version 
HTML document stored in the data storage section 17, 
thereby making a display reflecting the editing results 
(S1120) . 

[0144] Also in this embodiment, HTML document res- 
toration processing can be performed by using the tags 
in the original-versus-version HTML document in the 
same manner as that in Embodiment 2. 
[0145] Also, selective display processing (see Fig. 5) 
of the original-versus-version HTML document may be 
performed by the individual-language HTML document 
conversion processing section 16 if necessary, as in the 
HTML document processing apparatus 1 in the above- 
described Embodiment 1, thereby enabling only the 
Japanese HTML document (translated version) ob- 
tained by this processing to be visually displayed in the 
display section 11 . 

[Examples of Modification] 

[0146] In the HTML document processing apparatus 
1 in the above-described third embodiment, translated 
sentences are formed by manual translation. However, 
an automatic translation process using the automatic 
translation processing section 14 may also be per- 
formed in the third embodiment, similar to that in the 
HTML document processing apparatuses in the first and 
second embodiments. Conversely, manual translation 
may be performed in the HTML document processing 
apparatuses in the first and second embodiments. 
[0147] In each of the HTML document processing ap- 
paratuses in the above-described embodiments, an 
original written in English, is translated into Japanese. 
However, the HTML document processing apparatus of 
the present invention can also operate in the same man- 
ner with respect to the respective languages, and can 
similarly process an HTML document having sentences 



written in three or more languages. 
[0148] In the HTML document processing apparatus 
1 in the first embodiment, invisibility tags designating in- 
hibition of visual display are set as visibility control tags 

s in an HTML document. However, it is also possible to 
print only a document expressed in a particular lan- 
guage selected by setting such tags. 
[0149] Each of the above-described processes ac- 
cording to the present invention can be applied in asso- 

10 ciation with a computer-readable medium. 

[0150] Although only a few embodiments of the 
present invention have been described in detail above, 
those skilled in the art will readily appreciate that many 
modifications are possible in the preferred embodi- 

15 ments without departing from the novel teachings and 
advantages of the present invention. Accordingly, all 
such modifications are intended to be included within 
the scope of the present invention as defined by the fol- 
lowing claims. 



1 . A document processing apparatus comprising: 

language tag setting means for setting a lan- 
guage tag designating a kind of language at 
each of constituent unit positions in an original 
andthose in atranslated version corresponding 
to the original; and 

document forming means for forming a tagged 
document including the original and the trans- 
lated version each having the language tag set 
therein. 

2. A document processing apparatus according to 
claim 1, wherein said document forming means 
forms a tagged document in which the original and 
the translated version each having the language tag 
set therein arc described in an original-versus-ver- 
sion form. 

3. A document processing apparatus according to 
claim 1 or 2, further comprising translation means 
for translating the original to make the translated 



A document processing apparatus according to any 
one of claims 1 to 3, wherein the original is con- 
tained in the processing-object tagged document. 

A document processing apparatus according to any 
one of claims 1 to 4, furthercomprising visibility con- 
trol tag setting means for setting a visibility control 
tag for controlling any document portion so that the 
document portion is invisible, said visibility control 
tag setting means setting the visibility control tag at 
such a position that one of the original and the trans- 
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lated version is in an invisible state. 13. A document processing apparatus comprising: 



6. A document processing apparatus according to 
claim 5, wherein said visibility control tag setting 
means setting the visibility control tag at such a po- 
sition that one of the original and Ihe translated ver- 
sion is set in an invisible state and the language tag 
is also set in an invisible state. 

7. A document processing apparatus according to 
claim 1, further comprising display processing 
means for interpreting the tag contained in the 
tagged document and for enabling a browser for 
displaying the tagged document to display the 
formed tagged document on the basis of a result of 
the interpretation. 

8. A document processing apparatus comprising: 

language tag setting means for setting a lan- 
guage tag at each of constituent unit positions 
in a first text described in a first language and 
those in a second text described in a second 
language of the first text; and 
document forming means for forming a tagged 
document including the firsttext and the second 
text each having the language tag set therein. 

9. A document processing apparatus according to 
claim 8, wherein said document forming means 
forms a tagged document in which the first text and 
the corresponding second text each having the lan- 
guage tag set therein are described by being related 
to each other. 

10. A document processing apparatus according to 
claim 8, further comprising visibility control tag set- 
ting means for setting a visibility control tag for con- 
trolling a text so that the text is invisible, said visi- 
bility control tag setting means setting the visibility 
control tag at such a position that one of the first text 
and the second text is in an invisible state. 



analysis means for determining and extracting 
an original text from a processing-object 
5 tagged document; and 

editing means for enabling edit processing of 
the original text extracted from the tagged doc- 
ument by displaying the original text. 

10 14. A document processing apparatus according to 
claim 13, wherein said analysis means determines 
the original text according to a language tag con- 
tained in the processing-object tagged document. 

15 15. A document processing apparatus according to 
claim 13 or 14, further comprising translation 
processing means for making a translated version 
by translating the original text edited by said editing 
means. 

16. A document processing apparatus according to 
claim 15, further comprising restoration means for 
restoring, from the original text after the editing and 
the translated version made by said translation 

25 processing means, a tagged document in the same 
format as the processing-object tagged document, 
according to tags contained in the processing-ob- 
ject tagged document, said tags including the lan- 
guage tag. 

30 

17. A document processing apparatus comprising: 

analysis means for determining an original text 
and a translated version corresponding to the 
35 original text in a processing-object tagged doc- 

ument; and 

editing means for enabling editing of the origi- 
nal text and the translated version by extracting 
the original text and the translated version from 
40 the tagged document and by displaying the 

original text and the translated version in an 
original-versus-version form. 



11. A document processing apparatus according to 
claim 10, wherein said visibility control tag setting 
means sets the visibility control tag at such a posi- 
tion that one of the first text and the second text is 
set in an invisible state and the language tag is also 
set in an invisible state. 

12. A document processing apparatus according to 
claim 8, further comprising display processing 
means for interpreting the tag contained in the 
tagged document and for enabling a browser for 
displaying the tagged document to display the 
formed tagged document on the basis of a result of 
the interpretation. 



18. A document processing apparatus according to 
45 claim 17, wherein said analysis means determines 
the original text and the translated version accord- 
ing to language tags contained in the processing- 
object tagged document. 

50 19. A document processing apparatus according to 
claim 17 or 18, further comprising translation 
processing means for making a translated version 
by translating the original text edited by said editing 
means. 

55 

20. A document processing apparatus according to 
claim 1 9, further comprising restoration means for 
restoring, from the original text after the edit 
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processing and the translated version made by said 
translation processing means, a tagged document 
in the same format as the processing-object tagged 
document, according to tags contained in the 
processing-object tagged document, said tags in- 
cluding the language tag. 

21. A recording medium readable by a computer, said 
recording medium having a program recorded ther- 
eon, said program for enabling the computer to ex- 
ecute steps of: 

setting a language tag designating a kind of lan- 
guage at each of constituent unit positions in 
an original and those in a translated version 
corresponding to the original; and 
forming a tagged document including the orig- 
inal and the translated version each having the 
language tag set therein. 

22. A recording medium accordingto claim 21 , wherein 
said program recorded on said recording medium 
enables the computer to further execute a step of 
setting a visibility control tag for controlling any doc- 
ument portion so that the document portion is invis- 
ible, said visibility control tag being set at such a 
position that one of the original and the translated 
version is in an invisible state. 

23. A recording medium readable by a computer, said 
recording medium having a program recorded ther- 
eon, said program for enabling the computer to ex- 
ecute steps of: 

setting a language tag designating a kind of lan- 
guage at each of constituent unit positions in a 
first text described in a first language and those 
in a second text described in a second lan- 
guage of the first text; and 
forming a tagged document including the first 
text and the second text each having the lan- 
guage tag set therein. 

24. A recording medium readable by a computer ac- 
cording to claim 23, wherein said program recorded 
on said recording medium enables the computer to 
further execute a step of setting a visibility control 
tag for controlling a text so that the document is in- 
visible, said visibility control tag being set at such a 
position that one of the first text and the second text 
is in an invisible state. 



a processing-object tagged document; and 
enabling editing of the original text extracted 
from the tagged document by displaying the 
original text. 

26. A recording medium readable by a computer ac- 
cording to claim 25, wherein said program recorded 
on said recording medium enables the computer to 
further execute steps of: 

making a translated version by translating the 
original text after editing of the original text; and 
restoring, from the original text after the edit 
processing and the translated version formed 
by said translation, a tagged document in the 
same format as the processing-object tagged 
document according to tags contained in the 
processing-object tagged document, said tags 
including a language tag designating a kind of 



27. A recording medium readable by a computer, said 
recording medium having a program recorded ther- 
eon, said program for enabling the computer to ex- 
ecute steps of: 

determining an original text and a translated 
version corresponding to the original text in a 
processing-object tagged document; and 
enabling edit processing of the original text and 
the translated version by extracting the original 
text and the translated version from the tagged 
document and by displaying the original text 
and the translated version in an original-ver- 
sus-version form. 

28. A recording medium readable by a computer ac- 
cording to claim 27, wherein said program recorded 
on said recording medium enables the computer to 
further execute steps of: 



forming a translated version by tr 
original text after editing of the original text; and 
restoring, from the original text after the edit 
processing and thetranslated version made by 
said translation, a tagged document in the 
same format as the processing-object tagged 
document, according to tags contained in the 
processing-object tagged document, said tags 
including a language tag designating a kind of 
language. 



25. A recording medium readable by a computer, said 29. A recording medium readable by a computer, s 



recording medium having a program recorded ther- 
eon, said program for enabling the computer to ex- 
ecute steps of: 

determining and extracting an original text from 



recording medium having a tagged document re- 
corded thereon, wherein said recorded tagged doc- 
ument comprising: 

a first language tag designating a first kind of 
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language; 

a first text following said first language tag, said 
first text being described in a first language; 
a second language tag following said first text, 
said second language tag designating a sec- 
ond kind of language; and 
a second text following said second language 
tag, said second text being described in a sec- 
ond language, said second text corresponding 
to said first text. 

30. A recording medium readable by a computer, said 
recording medium having a tagged document re- 
corded thereon, wherein said recorded tagged doc- 
ument comprising: 

aninvisibiiitystarttagforsettingafollowingtext 
in an invisible state; 

a first language tag following said invisibility 
start tag, said first language tag designating a 
first kind of language; 

a first text following said first language tag, said 
first text being described in a first language; 
a second language tag following said first text, 
said second language tag designating a sec- 
ond kind of language; 

an invisibility endtag following said second lan- 
guage tag, said invisibility end tag canceling the 
invisible state; and 

a second text following said invisibility endtag, so 
said second text being described in a second 
language, said second text corresponding to 
said first text. 

31 . A document processing method comprising steps 35 



first text described in afirst language and those 
in a second text described in a second lan- 
guage of the first text; and 
forming a tagged document including the first 
text and the second text each having the lan- 
guage tag set therein. 

34. A document processing method according to claim 
33, wherein further comprising a step of setting a 
visibility control tag for controlling a text so that the 
document is invisible, said visibility control tag being 
set at such a position that one of the first text and 
the second text is in an invisible state. 

35. A document processing method comprising steps 



setting a language tag designating a kind of lan- 
guage at each of constituent unit positions in 
an original and those in a translated version 
corresponding to the original; and 
forming a tagged document including the orig- 
inal and the translated version each having the 
language tag set therein. 

32. A document processing method according to claim 
31 , wherein further comprising a step of setting a 
visibility control tag for controlling any document 
portion so that the document portion is invisible, 
said visibility control tag being set at such a position 
that one of the original and the translated version is 
in an invisible state. 

33. A document processing method comprising steps 



setting a language tag designating a kind of lan- 
guage at each of constituent unit positions in a 



determining and extracting an original text from 
a processing-object tagged document; and 
enabling editing of the original text extracted 
from the tagged document by displaying the 
original text. 

36. A document processing method according to claim 
35, wherein further comprising steps of: 

making a translated version by translating the 
original text after editing of the original text; and 
restoring, from the original text after the edit 
processing and the translated version formed 
by said translation, a tagged document in the 
same format as the processing-object tagged 
document according to tags contained in the 
processing-object tagged document, said tags 
including a language tag designating a kind of 
language. 

37. A document processing method comprising steps 



determining an original text and a translated 
version corresponding to the original text in a 
processing-object tagged document; and 
enabling edit processing of the original text and 
the translated version by extracting the original 
text and the translated version from the tagged 
document and by displaying the original text 
and the translated version in an original-ver- 
sus-version form. 

38. A document processing method according to claim 
37, wherein further comprising steps of: 

forming a translated version by translating the 
original text after editing of the original text; and 
restoring, from the original text after the edit 
processing and thetranslated version made by 
said translation, a tagged document in the 
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same format as the processing-object tagged 
document, according to tags contained in the 
processing-object tagged document, said tags 
including a language tag designating a kind of 
language. 
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FIG.l 
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FIG.4 
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FIG.5 
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FIG.6 



<! DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 TRANSITIONAL //EN"> 
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</HEAD> 
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FIG.8 
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FIG.9 
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FIG.10 
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